Unsupervised Analysis for Decipherment Problems

نویسندگان

  • Kevin Knight
  • Anish Nair
  • Nishit Rathod
  • Kenji Yamada
چکیده

We study a number of natural language decipherment problems using unsupervised learning. These include letter substitution ciphers, character code conversion, phonetic decipherment, and word-based ciphers with relevance to machine translation. Straightforward unsupervised learning techniques most often fail on the first try, so we describe techniques for understanding errors and significantly increasing performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Unsupervised Analysis of Structured Human Artifacts

Unsupervised Analysis of Structured Human Artifacts by Taylor Berg-Kirkpatrick Doctor of Philosophy in Computer Science University of California, Berkeley Professor Dan Klein, Chair The presence of hidden structure in human data—including natural language but also sources like music, historical documents, and other complex artifacts—makes this data extremely difficult to analyze. In this thesis...

متن کامل

Comparison school bonding and interpersonal problems in students with unsupervised and abused families with normal

This study aimed to compare the school bonding and interpersonal problems in students with unsupervised and abused families with normal families in Bandar Lengeh. The sample consisted of 152 normal students and 81 unsupervised or abused students. Normal students were selected by the multi-stage cluster sampling method. Data were collected through two questionnaires: school bonding (Rezaei Shari...

متن کامل

Exploiting Machine Learning Techniques to Perform Side Channel Attack

This paper proposes a novel unsupervised learning approach for Power Analysis – a form of side channel attack in Cryptanalysis. Different from existing works that exploit supervised learning framework to solve this problem, our method does not require any labeled pairs which contains {X,Y}={key, power-trace} information, but is still capable of deciphering the secret key accurately. Besides pro...

متن کامل

UNRAVEL - A Decipherment Toolkit

In this paper we present the UNRAVEL toolkit: It implements many of the recently published works on decipherment, including decipherment for deterministic ciphers like e.g. the ZODIAC-408 cipher and Part two of the BEALE ciphers, as well as decipherment of probabilistic ciphers and unsupervised training for machine translation. It also includes data and example configuration files so that the p...

متن کامل

Unsupervised Consonant-Vowel Prediction over Hundreds of Languages

In this paper, we present a solution to one aspect of the decipherment task: the prediction of consonants and vowels for an unknown language and alphabet. Adopting a classical Bayesian perspective, we performs posterior inference over hundreds of languages, leveraging knowledge of known languages and alphabets to uncover general linguistic patterns of typologically coherent language clusters. W...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006